Is the New Resonance Spin or 2? Taking a Step Forward in the Higgs Boson 

Discovery 
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The observation of a new boson of mass ~ 125 GeV at the CERN LHC may finally have revealed 
the existence of a Higgs boson. Now we have the opportunity to scrutinize its properties, determining 
its quantum numbers and couplings to the standard model particles, in order to confirm or not its 
discovery. We show that by the end of the 8 TeV run, combining the entire data sets of ATLAS 
and CMS, it will be possible to discriminate between the following discovery alternatives: a scalar 
J p = 0+ or a tensor J p — 2 + particle with minimal couplings to photons, at a 5a statistical 
confidence level at least, using only diphotons events. Our results are based on the calculation of a 
center-edge asymmetry measure of the reconstructed sPlot scattering polar angle of the diphotons. 
The results based on asymmetries are shown to be rather robust against systematic uncertainties 
with comparable discrimination power to a log likelihood ratio statistic. 



I. INTRODUCTION 

The CERN LHC has recently achieved one of its main 
goals - the discovery of a new particle whose known prop- 
erties, until this moment, points to a Higgs boson [l|, 0- 
In fact, as the new particle decays to pairs of gauge 
bosons and fermions, a non-integer spin is ruled out. 
Moreover, the new resonance decays into pairs of pho- 
tons which, accordingly to the Landau- Yang theorem [3| , 
discards the spin-1 possibility, leaving either a scalar or 
a tensor option for the new boson. 

Concerning the parity properties of the new boson, 
as a pseudo-scalar has no renormalizable interactions 
to SM particles, and given the lack of evidence of new 
physics at the electroweak scale, the J p = 0~ possibility 
is currently disfavored by collider data, noticeably the 
H — > WW branching ratio which is expected to be two 
orders of magnitude smaller than the observed one Q. 

The measured mass, production cross section, and 
branching ratios favor the Higgs boson hypothesis 
though. In fact, scalar impostors alternatives can also ex- 
plain the observed signal I5j while spin-2 alternatives like 
KK-gravitons from ADD 6] or Randall-Sundrum (RS) [7] 
models predict different decay patterns at the same time 
they are already very constrained by the LHC data Q. 
Nevertheless, a dedicated study of the spin of the new res- 
onance is a necessary step to ascertain its scalar or tenso- 
rial nature and, consequently, fulfill a necessary condition 
to claim the discovery of a Higgs boson. 

The possible enhancement in the diphoton channel has 
motivated several works of Standard Model (SM) ex- 
tensions capable to explain the experimental observa- 
tion [Toi a lthough a SM Higgs boson is by no means ruled 
out [(J [ll| by the present data. Regardless the absolute 
size of the branching ratio to photons, all the proposed 
SM extensions that involve a scalar particle predict an 
isotropically distributed yield at parton level. This is 
not the case for a typical spin-2 state which decays in a 
highly anisotropic way inside the detector 12j. 

Analyzing the shape of the angular variables of charged 



leptons, with multivariate likelihood techniques, proved 
to be able to discriminate between a scalar, a vector, or 
a tensor hypothesis for a favorable mass region in ZZ — > 
decays with sufficient data at an upgraded 10 
or 14 TeV LHC [THl|. 

More recently, and update of the ZZ channel, and a 
brand new analysis for spins in the WW and 77 channels 
were made for the 8 TeV LHC [lj| . Concerning the spin-0 
versus spin-2 discrimination, the main conclusion of this 
work was that around a 4<r confidence level in the CP- 
even spin-0 resonance hypothesis is expected for 35 fb _1 
at one LHC experiment, combining the ZZ, WW, and 
77 channels. 

Spin analyses for spin-2 RS KK-gravitons at the LHC, 
relying on asymmetries of angular distributions, were car- 
ried out in dilepton and diphoton channels based on x 2 - 
tests [l3| ■ As the branching ratio of these KK-gravitons 
into photons is much larger than the corresponding SM 
Higgs boson, the backgrounds are not a serious issue for 
spin discrimination in this case. 

The experimental fact, however, is that the 77 channel 
alone does not look promising given the tiny observed 
signal to background ratio of 0(1O~ 3 ) in the mass re- 
gion of 100 GeV- 160 GeV [E i]. Whatever the kine- 
matic variable chosen to discriminate between the two 
hypothesis, the huge background contamination would 
wash away any distinctive feature of the distributions in 
this channel. Requiring harder acceptance cuts, on the 
other hand, may help to clean the data sample, but at 
the cost of modifying the observed angular distributions 
which are used to compare the predictions of different 
spin hypotheses. 

Therefore, determining the spin of the new resonance 
studying the shape of the kinematic distributions of pho- 
tons would require a truly clamping technique to pick 
the signal events from the huge ensemble of background 
events without demanding too hard cuts. This is pre- 
cisely what has been suggested for the ZZ channel [3| 
by means of the s Weight technique [T^. In this chan- 
nel, signal to background ratios are also very small, and 
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an optimal separation of signal and background angular 
distributions was achieved by using sPlots [17[ ■ The tech- 
nique has also been used in several experimental studies 
at BaBar, Tevatron, LHCb, and the LHC [Hj|. 

We show that, weighting diphoton events according to 
their well understood invariant mass distributions using 
the sW eight technique, a high confidence level discrimi- 
nation is possible between a scalar J p = + and a tensor 
J p = 2 + resonance with minimal couplings to SM parti- 
cles. This is done by measuring a center-edge asymme- 
try of the polar angle of the photons in the resonance 
rest frame. At the time when the 77 excess reaches a 5a 
significance level in a SM-like Higgs boson signal against 
backgrounds, a spin-0 or spin-2 will be favored to an even 
higher confidence level. We also show that the asymme- 
try is rather insensitive to systematic errors and can give 
even better results than a log likelihood ratio analysis. 

The paper is organized as follows: in section II we 
present the expected parton and detector level distribu- 
tion for the scattering polar angle of diphotons in spin- 
and spin-2 decays. In section III we summarize the 
sWeight technique. The section IV is devoted to compu- 
tation of the statistical significances based on the asym- 
metry measure and the log likelihood ratio statistic in- 
cluding systematic uncertainties. Results are presented 
in section V and the section VI contains our conclusions. 



II. POLAR ANGLE DISTRIBUTION OF 
DIPHOTONS FROM SPIN-0 AND SPIN-2 
RESONANCE DECAYS 

In this section we present the parton and 
hadron/detector level distributions for the scatter- 
ing production angle 6* in the center-of-mass (CM) 
system of the photon pair at the LHC. 



A. Parton level distributions 

We concentrate on the main production mode of a 
Higgs boson at the LHC, the gluon fusion process [20| . 
with subsequent decay to a pair of photons 



5(Ai)3(A^)^Vo,V 2 ^ 7 (A 1 ) 7 (A 2 ) 



(1) 



Here A1.2 = ±1 denote the photon helicities, A^ 2 = ±1 
the gluon helicities, and Xq,X2 a spin-0 + and spin-2 + 
resonance, respectively. 

In the resonance rest frame, the theoretical parton level 
^-distribution for a spin-0 resonance decaying to two 
photons is independent of the 6* angle 
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where P(9*) denotes the probability density function 
(pdf) of the 8* variable, and d J mm ,(Q*) are the Wigner 
functions 12611 . 



The amplitude for the decay of a spin-2 state of mass 
ran into two photons can be conveniently written as (TBI . 



M(X 2 ^77) = jOi 



i=2 



(3) 



where 0\ is a dimension-2 operator, while Oi, i — 
2,3,4,5, are dimension-4 operators, and A an energy 
scale where new physics is expected to appear. The min- 
imal coupling of a spin-2 state to photons corresponds 
to set gi 7^ and g^o = 0. The explicit expressions 
of these operators can be found in the appendix and in 
Refs. dEEl as well. 

The helicity amplitudes of the decay X 2 — > 7 (Ai) 7 (A2) 
can be expressed in terms of the gi couplings of Equa- 
tion © as 
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Here, the c, are combinations of gi couplings: c± = 2g\ + 
g 2 n, c 2 = -gi/2 + K(g 3 /2 + g4Yc A = (-g 2 /2 + g 3 + 2g 4 )K, 



C5 = 2g$K, and K 



; /A 2 



The amplitudes of Equation(|4|) take into account the 
most general couplings between a spin-2 state and mass- 
less vectors. In this point, we specialize our analysis to 
the minimal couplings between the spin-2 resonance and 
photons and gluons. It has been suggested that some 
non-minimal couplings configurations are already disfa- 
vored by the current data [28[. Note that setting g$ 7^ 
and <7i^5 = corresponds to the pure parity-odd tensor 
state 2~. 

If (74 or <75 are not vanishing, for instance, the spin-0 
vs spin-2 discrimination is an easier task in the 77 and 
ZZ channels if we explore the angular distributions of 
photons and charged leptons, respectively jl6j. In this 
respect, discriminate between a + and a 2 + hypothesis 
is not the easiest conceivable scenario. Yet, through a 
careful tuning of the gi couplings, is possible to approxi- 
mately mimic the cos(#*) distribution of a spin-0 + state 
decaying to photons. 

Our main goal, in this work, is to show that the 
sW eight technique, for which we give a brief description 
in the next section, is a powerful tool to study the spin 
of the new resonance in the 77 channel. We choose the 
minimal couplings scenario for a resonant spin-2 particle 
as a concrete example. 

A common feature of theoretically well motivated 
extra-dimensions models, as the large extra dimensions 
scenario of the ADD model [6] and the warped extra 
dimension Randall- Sundrum model Q, is the existence 
of spin-2 Kaluza-Klein gravitons. The collider effects 



3 



of large extra dimensions of the ADD scenario are ex- 
pected to be observable as towers of almost degener- 
ated KK-gravitons [2l|. Despite it might be possible to 
study the angular dependence of photons coupled to such 
KK-towers, the sWeight analysis requires a well under- 
stood 77 invariant mass distribution to work. This is 
more easily accomplished for narrow resonances^ as those 
expected from the RS warped extra-dimension scenario 
with large mass splittings among the graviton states [2l[ . 

The Lagrangian for the interaction between SM parti- 
cles and a massive RS KK-graviton in 4 space-time di- 
mensions is given by 

where G^ v is the field of the I s * massive KK-graviton 
mode, T^ v the SM stress-energy tensor, and A the scale 
up to the effective theory is valid and that can be identi- 
fied with the energy scale in Equation (J3J. This interac- 
tion Lagrangian involves only minimal couplings between 
the graviton and SM fields [23l-[25j . 

With minimal couplings, c-i — — 4ci and all the other 
Cj vanish. As the result of an accidental cancellation, 

M(++) = M( ) = 0, and the decay width of the 

KK-graviton to photons results in the following formula 

Ai,A 2 m 

Ai,A 2 m 

= ^iM(+-)i 2 (i4 2 (ni 2 +i4- 2 (^)r) 

(6) 

The Equation (j6]) reproduces the well known result [Tsl . 
IM H3 for the normalized P 2 (9*) 

P 2 (0*) = ^ (1 + 6 cos 2 (r ) + cos 4 (r)) (7) 

Despite it is not favored by the current data in the 
Higgs search and by the stringent constraints from reso- 
nant production and decay to WW bosons we take 
the Randall-Sundrum model, where a narrow spin-2 res- 
onance couples minimally to the SM particles, as a straw 
man scenario against with the spin-0 hypothesis can be 
tested. This is a mere convenience. We could also have 
used a similar model with identical minimal couplings 
with no reference to any known model. In particular, we 
normalize all the spin-2 cross sections to the SM Higgs 
boson ones, leaving the cos(#*) distribution as the only 
discriminant. In the following discussions we will refer 
also to the spin-2 resonance as a KK-graviton. 



1 Narrow resonances of Regge excitations with minimal couplings 
to SM particles are also expected in open string theories \'2'M . 



P(0*) 




FIG. 1. The normalized pdf of the angular variable cos(#*). 
The solid blue line represents the parton level predictions for 
a spin-2 KK-graviton, and the flat dashed line a spin-0 distri- 
bution. The dotted line shows the distorted distribution after 
including detector effects for hadron level events for the spin- 
0, and the dot-dashed line the distorted spin-2 KK-graviton 
case. 

We show in the Figure (JXJ) the parton level distribu- 
tions of the cos(#*) variable for a Higgs boson and a 
KK-graviton with minimal couplings according to Equa- 
tions © and (J7J. These distributions are considerably 
distorted by detector acceptances as we discuss in the 
next section. 



B. Distorted shapes including detector effects 

The parton level distributions are considerably dis- 
torted after taking into account initial-state radiation 
and detector resolution effects. The geometric restric- 
tions of the detectors, reflected in the finite rapidity cov- 
erage, play the role of boundary conditions on the P(0* ) 
functions such that P(6* = 9*,tt — 6*) = where 6* is 
the limiting angle from the rapidity cut. 

In order to simulate these effects in the shape of the 8* 
distributions, we generated a large number of events (typ- 
ically O(10 5 )) to a SM Higgs boson, to represent the + 
case; a massive first excited state of a RS KK-graviton, 
representing the 2 + alternative; and the irreducible and 
main reducible backgrounds for 77 production at the 
LHC, using MadGraph5 [29j with CTEQ6L1 parton distri- 
bution functions [3(j|. The parton level events were, then, 
interfaced to Pythia 6.4 [3l| to simulate hadronization 
and jet clustering, and to PGS to a fast detector simula- 
tion with ATLAS settings [l|. 

The hard matrix element contributions and soft radi- 
ation from parton showers were consistently merged in 
the MLM scheme [3^|. The merging is crucial for a reli- 
able graviton simulation because there are tree level dia- 
grams contributing to the inclusive process pp — > 77 + X 
through qqgG, gggG and ggggG vertices. Despite these 
are not leading contributions, their impact on the shape 
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of distributions have to be taken into account properly. 
Reducible backgrounds as jj and jj also demand this 
kind of careful treatment. 

We show in the Figure (fTJ) the cos(#*) distribution for 
photons pairs decay of a spin-0 and a spin-2 particle at 
the 8 TeV LHC. Without detector effects, the spin-0 state 
decays isotropically in the CM system of the photon pair, 
resulting in a flat cos(0*) distribution, while the spin- 
2 KK-graviton exhibits the quartic polynomial behavior 
of Equation ([7]). After a full simulation, on the other 
hand, the distorted shapes of the distributions get more 
similar to each other. Given this more realistic situation, 
it is crucial to evaluate the real potential of the LHC 
to discriminate between the two cases by simulating the 
statistical fluctuations in a real experimental. This can 
be done performing a sufficiently large number of pseudo- 
experiments based on Monte Carlo (MC) simulations. 

We have imposed the following selection cuts based on 
the ATLAS analysis Q in the generation of the signal 
and background MC events for the 7 and 8 TeV energy 
regimes at the LHC 



E Tl > 40 GeV , E T2 > 30 GeV 
|«7x,a] < 2.37 , AR 1<2 > 0.4 
100 GeV < Mi 2 < 160 GeV 



(8) 



and excluding the calorimeter barrel/end-cap transition 
region 1.37 < |»?i,2| < 1-52. The leading(sub-leading) 
identified photon is required to have Et > 40(30) GeV 
inside the calorimeter barrel and within the 100-160 GeV 
invariant mass window. The same criteria was ap- 
plied when jets or charged leptons are misidentified as 
photons. 

The SM Higgs boson production cross section from 
gluon fusion were normalized by the improved NNLO- 
NNLL QCD calculation [H at m H = 125 GeV at the 7 
and 8 TeV LHC. We have also included a 95% identifica- 
tion efficiency per photon. The Higgs boson decay width 
was calculated at NLO QCD+EW with HDECAY As 
a final result, around 25(20) Higgs boson events/ fbV 1 
are expected for the chosen mass at the 8(7) TeV LHC, 
which are similar to the expected numbers from the ex- 
perimental collaborations [l|, 0] ■ 

As we are considering the RS model just as a conve- 
nient example of an alternative spin-2 scenario against 
with we test the spin-0 hypothesis, the gravitons are as- 
signed the same mass and width of the Higgs boson, and 
the cross sections for the production and decay of a reso- 
nant graviton is normalized by the SM Higgs rates. This 
is the most conservative scenario for model discrimina- 
tion. 

The MC background events were normalized by the 
experimental number of events quoted by ATLAS |l| for 
5.8(4.9) ftT 1 at the 8(7) TeV LHC for the selection crite- 
ria of Equation ((SJ, respecting the measured composition 
of 74%, 22%, 3% and 1% for the 77, 7 j, jj and Drell- 
Yan contributions for a total of 35002(23619) background 
events at the 8(7) TeV LHC. 



III. THE SWEIGHT TECHNIQUE 

For the selection cuts adopted by the experimental col- 
laborations, after taking into account all the efficiency 
factors, around 3 signal 77 events from a Higgs boson de- 
cay are expected for each 1000 background events at the 
7 and 8 TeV LHC 0, Q in the mass range 100-160 GeV. 
Measuring the shape of any kinematic distribution thus 
requires an optimal selection technique. 

Similar signal to background (S/B) ratios are en- 
countered, for example, in B-physics experiments [l8T ] 
where an excellent signal versus background separation 
is achieved by means of the s Weight technique 17]. The 
method was employed recently to discriminate between 
spins and parities of "Higgs-alikes" using four charged 
lepton events from ZZ decays [3| also for very small 
signal to background ratios. We give here just a brief 
description of the method. 



A. Brief description of the formalism 

The s Weight formalism relies on an unbinned extended 
maximum Likelihood approach to a data sample in which 
signal and background events are mixed. If N s signal 
events and N b background events (which can be the out- 
come of the composition of a number of background com- 
ponents) are expected on average for a given integrated 
luminosity, the log likelihood for a given data sample of 
N e events is given by 

C = In [NJ s (x k ) + N b f b (x k )} -N s ~N b (9) 
fc=i 

where f s (%k) and fb(%k) are the probability density func- 
tions of a discriminating variable x evaluated at the k th 
data event. These pdfs are obtained from MC simulation 
for signal and backgrounds and must be well understood 
in order the method to be reliable. 

Given the N s and N b that maximizes £, we compute 
the sW eights for signal (n = s) and background (n = b) 
species for the k th data event as 



tut 1 \ Vnsfs(Xk) + Vnbfb(xk) 
iW n (Xk) = = 

NJ s (x k ) + N b f b (x k ) 



(10) 



The covariance matrix V is obtained by inverting the 
matrix V~/ = -d 2 £/dN n dN l which can be computed 
by numerical means. 

After the computation of the sW eights, we can popu- 
late histograms of any distribution of interest y, that is 
not correlated with the discriminating variable x, accord- 
ing to 



Vn = X! sW n (xk) 

\vh-Vn\<Sy/2 



(11) 



where y n is the central value of a given bin of width 
5y, and y k the value of the y variable for the k th event. 
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The histograms constructed from this prescription are 
called sPlots and none previous knowledge of the target 
distribution y is assumed. 

The sPlots are proved to bear important properties 
upon which stands the power of the method. First of 
all, they converge asymptotically to the true binned dis- 
tributions and reproduce, on average, the finite statis- 
tics histograms. Summing up the events from signal and 
background sPlots recovers the number of events in the 
data sample in a composition given by the signal and 
background number of events provided by the maximum 
likelihood fit, i.e, N s + Nb — N e , while the statistical 
uncertainty on each j/-bin can easily computed as 



°(yn) = J E ° W %M ( 12 ) 

V \Vk-Vn\<Sy/2 

Moreover, the sPlots can be merged on an event-by- 
event basis in a straightforward way allowing us to com- 
bine analyses from different experiments and different 
runs, for example. The normalizations and error bars 
of the merged sPlots are guaranteed to reproduce the 
correct ones asymptotically. 

A very important requirement for the s Weight tech- 
nique to work is a reliable parametrization of the dis- 
criminating distribution. It means we must choose a 
well understood variable both for signal and background 
events. In our case, the natural best choice is the 77 in- 
variant mass distribution Mi. 2. The LHC collaborations 
usually get good parametrizations using a Crystal Ball 
function or combinations ofgaussian distributions to fit 
a resonant M Xi 2 structure 00]. For the backgrounds, 
usual parametrizations are provided by polynomials and 
simple exponential functions 0, • 

We found that our MC signal events are well described 
by 6 parameters from the combination of two gaussian 
distributions: the means, widths, and normalizations. 
As in the experimental situation, the total background 
is well parametrized, in the invariant mass region con- 
sidered here, by a simple exponential distribution with 
two parameters, one for the shape and the other for the 
normalization of the distribution. We show in Figure ([2]) 
the MC points for our simulated events and the fitting 
curves for the resonant scalar boson invariant mass and 
for the total background. Note that the maximum of the 
resonance occurs for a somewhat smaller value of the in- 
variant mass compared to the scalar mass of 125 GeV. 
This is the effect of the photons energy loss inside the 
detector which can be compensated by the experimental 
collaborations using a proper calibration for the photon 
energy. 

It is also important to check that the discriminating 
and the target variables are not correlated. We show 
in Figure © density plots in the M 77 vs cos(0*) plane 
for our simulated events and the joint probability func- 
tion P(M 77 ,0*) = P(M 77 ) x P(9*) supposing indepen- 
dent variables. The independence can be further corrobo- 
rated by computing the covariance between the variables, 



Events 




FIG. 2. The points show the MC events for an 125 GeV scalar 
resonance at the upper panel and for the total 77 backgrounds 
at the lower panel. The red curves are fitted from the MC 
points as a sum of two gaussian pdfs and an simple exponen- 
tial pdf for the signal and the background M T7 distributions, 
respectively. 



which, in this case, is 0.07 for the spin-0 and 0.04 for the 
spin-2 sample corresponding to the figure. 



B. sPlots of the scattering polar angle distributions 

From a large number of MC events generated, we ran- 
domly selected N e pseudo-experiments in the statistical 
bootstrap sense, putting the events back into the sample 
in order they can be selected again. We verified that this 
procedure is equivalent to obtain random variates from 
the empirical distributions of the simulated samples. We 
show in the Figure (@]) the cos(#*) sPlots for the SM Higgs 
boson and the spin-2 KK-graviton. The figure also shows 
the histogram for a pseudo-experiment and the theoreti- 
cal distribution as given by the MC simulation. 

It is truly amazing how well the sPlots reproduce 
the pseudo-experiments represented by the histograms 
in Figure (J4| and the statistical errors associated to each 
bin. We superimpose the theory predictions including 
detector effects from a large number of MC events. The 
agreement between the theoretical curve and sPlots is 
very good. We also checked that the background ex- 
pectations are very well described by the corresponding 
sPlots. 

We present in the next section the statistical signifi- 
cances for spin-0 vs spin-2 model discrimination. 
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Cos(tf') 



Cos(fl') 



FIG. 3. Upper plots: the empirical joint distribution 
P (M 77 ,#*) of generated MC events for a spin-0 (right panel) 
and a spin-2 resonance (left panel). Lower plots: the corre- 
sponding theoretical pdfs assuming that M 77 and cos(#*) are 
statistically independent variables. 



IV. STATISTICAL SIGNIFICANCES FOR THE 
SPIN DISCRIMINATION 

Given the optimal separation between signals and 
backgrounds, we are in the position to construct a statis- 
tic to perform the hypotheses tests in order to project 
the necessary amount of data to discriminate between a 
scalar boson and a KK-graviton alternative for a fixed 
confidence level (CL). 

The log likelihood ratio (LLR) is the best statistic to 
perform parametric tests and has been successfully used 
in recent works for the Higgs boson case m ZZ 

and WW decays. Suppose that /z is the expected num- 
ber of spin-0 events in a given bin i of an sPlot histogram 
of cos(#*) with Nbins > and the corresponding number for 
a spin-2 distribution is If the data are denoted by d 1 , 
the log likelihood ratio to discriminate between the spin 
hypotheses is defined as 



A 



N H „ S 



Mo — A*2 



(fin 



(13) 



With no free parameters, the LLR follows a gaussian 
distribution [35| whose mean scales with the integrated 
luminosity as L and the standard deviation as \L. For 
diphoton events, the scattering polar angle is the most 
useful discriminant between the alternative scenarios. 
Relying on just one distribution to construct the statistic, 
the LLR is shown to have similar discrimination power 




Cos(0*) 



o Cos(0*) 



FIG. 4. The cos(#*) distribution for a scalar (upper panel) 
and a KK-graviton (lower panel) including cuts, hadroniza- 
tion and detector effects. The histograms represent the out- 
come of a pseudo-experiment, the points with error bars are 
the sPlots, and the solid lines the theory prediction from a 
large number of generated MC events from MadGraph5. 



of a \ 2 test [36}. The LLR performs as better as more 
discriminating variables are used together taking into ac- 
count their mutual correlations LJ]. Moreover, as we will 
show, it is rather sensitive to systematic errors. 

Another simple and robust statistic, although less pow- 
erful than the LLR, is the Center-Edge asymmetry de- 
fined as 



.4 



cr(|cos(0*)| > 0.5) - ct(|cos(0*)| < 0.5) 
(t(|cos(0*)| > 0.5) + ct(|cos(0*)| < 0.5) 



(14) 



From the theory point of view, an asymmetry is a 
model prediction which is intrinsically related to the 
shape of a given variable. In our case, the variable reflects 
the actual spin quantum number of the resonance. 

A sample of asymmetries follows a gaussian distribu- 
tion with mean A and standard deviation a a, the mean 
of the asymmetry error [371 ] 



I- A 2 



N 



(15) 



where N is the total number of events for a given inte- 
grated luminosity L. 

Note that an asymmetry does not depend upon L, 
only its related statistical error. In fact, any system- 
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atic uncertainties affecting the signal yield as a multi- 
plicative factor cancel in the ratio. There are several 
errors of this type, including systematics on luminosity, 
selection efficiencies, theoretical errors from renormaliza- 
tion/factorization scale uncertainties, for example. The 
most dangerous types of errors for an asymmetry mea- 
sure are those related to bin migrations and errors that 
distort the shape of the distributions in general as the 
jet energy scale and parton distribution functions uncer- 
tainties, for example. Moreover, as it gathers all events 
in two larger sets - the Center bin |cos(0*)| < 0.5 and the 
Edge bin |cos(0*)| > 0.5 - the central asymmetry value 
is less sensitive to statistical fluctuations. 

On the other hand, if there is a contamination from 
background events, even when it is small, the asymme- 
tries may be washed away or biased to a common value 
which requires a larger number of events for a good dis- 
crimination. Kinematic cuts are usually employed to get 
a cleaner sample of signal events, but this time a sculpt- 
ing effect may distort the shape of the target variable 
and turns the predictions more similar. The sWeight 
technique circumvents these difficulties and allows us to 
fully explore the differences in the shapes of variables to 
obtain accurate asymmetry values. 

From the point of view of the computational burden 
necessary to give a reliable estimate of the statistical 
confidence level of the model discrimination, a relatively 
small number of pseudo-experiments is necessary to build 
the probability distribution function of the asymmetries. 
We verified that 10 3 -10 4 events are sufficient to obtain 
the Gaussian parameters of the A distribution with a 
very good precision. 

Once the asymmetries for both alternatives have been 
calculated from their corresponding sPlots, and as we 
know that P (data\A) and P 2 (data\A), the pdf of the 
spin-0 and spin-2 asymmetry^, respectively, are normally 
distributed, the statistical significance of the hypothesis 
test can be easily computed 

Z{o) = ^- 1 {l-p)= \ A °Z M (16) 

If A c is the chosen critical value for the test, the p- 
value p is the probability of a type-I error (a) given by 
P = J A P 2 {data\A)(x)dx, and represented by the yellow 

shaded area (pi) under P2(data\A) for A c — Aq in Fig- 
ure (|S|). The inverse of the cumulative distribution func- 
tion of the standard normal, calculated at 1 — p, 
gives the statistical confidence level Z(a) of the test in 
units of standard deviations of a standard gaussian dis- 
tribution. 

If we have good reasons to believe that the 125 GeV 
resonance is indeed a scalar particle, the critical value is 
set to the spin-0 mean or median prediction, A c = Aq 



2 Here, data denotes the generated pseudo-experiments. 
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FIG. 5. The probability density function of the center-edge 
asymmetry A extracted from the fitting to a sample of 10 4 
pseudo-experiments for the spin-0 and spin-2 distributions. 
The colored areas represent the type-I and type-II errors for 
different critical values. 



and p = a = p\ is the probability to accept the spin-0 
hypothesis when, in fact, it is wrong. If we require the p- 
value to be very small, we loose control on the probability 
of rejecting the scalar hypothesis when it is correct one 
- this is the type-II error (/?), which can be calculated as 

/3 = Po(data\A)(x)dx — 0.5. This is a prejudicious 
choice based on a priori belief in the scalar option. 

A choice with no preferences would be taking a crit- 
ical value for which the type-I and type-II errors were 
equal [35]. That is it, if no prior information would 
be available at the time of the spins discrimination, 
there would not be any reason to favor a particular 
choice. In this case, the p- value increases by the magenta 
shaded area (p 2 ) of the Figure (0 while /3 decreases to 
P3 +P4 < 0.5. If the pdfs are not too asymmetric, a good 
approximation to the critical value is A c = (Aq + A 2 )/2 
and the confidence level in any of the two hypotheses is 
smaller by a factor of 2. 

Under the light of the current information about the 
new boson, we believe that the prejudicious choice of a 
spin-0 particle hypothesis is not conceptually misleading. 
Any serious alternative spin-2 candidate should show up 
with similar branching fractions to pairs of Z, W, 7, and 
SM fermions, beside the right production rate. 



A. Incorporating systematic errors 

Once we have reliably estimated the signal and back- 
ground pdfs, it is possible to incorporate sources of sys- 
tematic errors in the computation of the statistical con- 
fidence level in an approximate fashion. It is beyond the 
scope of this work to estimate all sources of systematic 
uncertainties that could has an effect on the calculation 
of asymmetries, which is a task aimed to the experimen- 
tal collaborations. Yet, we provide a simplified analysis 
to show that we should not expect any large deviations 
due systematic uncertainties on the asymmetry measures, 
but the tests based on the LLR are somewhat degraded. 
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Suppose the pdfs depend on a number of nuisance pa- 
rameters {ei, • • • , e n } that describe uncertainties from n 
sources of systematic errors. To embody these effects in 
the attainment of the confidence belt, we could marginal- 
ize the nuisance parameters in a Bayesian approach, con- 
voluting the asymmetry pdf with a prior density 7r(e) for 
each systematic uncertainty. For example, for a single 
nuisance parameter normally distributed around e with 
error a £ [381 ] , the asymmetry probability density function 
is given by 

(e'-e) 2 

P(data\A,e) = / P(data\A,e') =ds' (17) 

Jo ct £ V27t 

In fact, a hybrid technique can be used [39j . A frequentist 
approach is possible in the sense that the effects of these 
systematic errors could be estimated from an ensemble 
of pseudo-experiments each of them with a randomly se- 
lected value of e. 

As we are interested in signal events /i s as simulated 
from theory, unlike the backgrounds, we integrate over 
nuisance parameters of the signal pdfs only. For the cor- 
related errors that we choose to illustrate the impact of 
systematic errors - the uncertainty on the integrated lu- 
minosity (el) and the selection efficiency (e e //) _ it is 
easier to just adopt the frequentist approach and multiply 
the number of signal events as fi' a — /i s (l + £l)(1 + £ e //) 
for each generated pseudo-experiment. We adopt the un- 
certainties quoted in the ATLAS experiment |l|: a rel- 
ative uncertainty of 3.6% in the luminosity and 11% in 
the selection efficiency. 

In fact, none of these correlated sources of systematic 
errors changed the value of the original asymmetries no- 
ticeably nor their probability density functions, confirm- 
ing the expectation that this kind of uncertainty cancel 
in the ratio. On the other hand, the probability density 
functions of the LLR statistic broadens, increasing the p- 
values. We compare the performance of the center-edge 
asymmetry and the LLR statistic in the next section. 



V. RESULTS AND DISCUSSION 

We present in Table I the values of the asymmetries for 
the spin-0 and the spin-2 resonances computed from the 
parton level MC events, the distorted detector level MC 
events, and from the corresponding sPlots with marginal- 
ized correlated systematic errors as discussed in the pre- 
vious section. 

An approximate shift of 0.3 in the asymmetry values 
is observed from parton level to detector level and sPlots 
both for spin-0 and spin-2 scenarios. This is the con- 
sequence of the migration of events from the edges to 
the center of the cos(#*) distributions due the geometry 
restriction of the detectors. 

As we pointed out earlier, the sPlots from any number 
of channels can be added straightforwardly. We can take 





Parton level 


Detector level 


sPlots 


spin-0 + 





0.324 


0.325 ± 0.042 


spin-2 + 


-0.207 


0.005 


0.005 ± 0.044 



TABLE I. The values of the center-edge asymmetries for a 
spin-0 and a spin-2 resonance at parton level, detector level, 
and from sPlots with 10 3 pseudo-experiments. 



advantage of this and combine all the available data pro- 
duced in ATLAS and CMS collaborations and from the 

7 and 8 TeV runs. 

Figure ([6]) shows the main results of this work. The 
statistical significance of the spin-0 boson versus spin-2 
KK-graviton discrimination, based on the asymmetries 
of their cos(6**) distributions, are shown for the 8 TeV 
ATLAS run solely, the 7 and 8 TeV runs, and for the 
entire data set from ATLAS and CMS for both runs. 

The upper panel of Figure © show the statistical sig- 
nificance in the spin-0 hypothesis against the spin-2 hy- 
pothesis. The solid line represents Z(a) as a function 
of the integrated luminosity at the 8 TeV LHC for the 
ATLAS experiment only. A 5a discrimination is possible 
with ~ 15 fb _1 . Adding the 7 TeV run increases the sig- 
nificance to 5.5a with the same amount of data from the 

8 TeV run. 

For this luminosity, however, a 5a signal to background 
discrimination cannot be reached relying only on the 77 
channel [l], in the case of a SM Higgs boson. It should 
be pointed out, however, that the present observed signif- 
icance is 4.5er which characterizes an 1.8 signal strength 
in the H — > 77 channel. If this enhancement persists, 
with the current signal strength, a 5a spin discrimination 
of the spin-0 hypothesis against the spin-2 with minimal 
couplings should already be feasible. 

If we conservatively assume that the current excess is 
just a momentary statistical fluctuation, the 5a C.L. in 
the signal vs background hypothesis will be reached for 
around 40 fb _1 of data, or some 20 fb _1 per experi- 
ment after ATLAS and CMS combination for a SM signal 
strength [l|, Q , but we still can combine the ATLAS and 
CMS data for the 7 and 8 TeV runs for the spin diag- 
nosis. In this case, the red line in the upper panel of 
Figure ^ represents the combined result with a lumi- 
nosity L for each experiment. For L — 20 fb _1 a very 
high statistical C.L. of almost 9a in the scalar hypoth- 
esis can be reached. Note that with the current data, 
a combined analysis is already able to discriminate be- 
tween both spin alternatives. Of course, as the signal 
excess in the diphoton channel can not be attributed to 
a new resonance at the required statistical level, for the 
LHC standards, in principle, it makes no sense to talk 
about which kind of particle is being produced. 

In spite of being usually a more powerful statistic, the 
log likelihood ratio is much more sensitive to systematic 
uncertainties. To evaluate the impact of these uncer- 
tainties, we adopted a larger variation in a single corre- 
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FIG. 6. The expected significance for spin-0 versus spin-2 
KK-graviton hypothesis, based on the asymmetry measure, 
at the 8 TeV LHC for the ATLAS experiment (lower solid 
lines), 7 and 8 TeV combined data for ATLAS (middle solid 
lines), and combining all available data from the 7 and 8 TeV 
runs for ATLAS and CMS (upper solid lines). The dashed 
lines at the upper panel represent the discrimination power 
of a LLR statistic with systematic uncertainties ranging from 
(upper line) to 30% (lower line) in the normalization of 
the signal strength. The lower panel depicts the significance 
calculated in the non prejudicious approach where a — f3 as 
explained in the text. The 90% and 99% CL lines are also 
shown. 



lated systematic error in the signal prediction, varying 
its magnitude from to 30%, and distributed normally 
as discussed in the previous section. The shaded yellow 
band, between the dashed lines in the upper panel of Fig- 
ure (J6j) , shows the loss in the discrimination power in the 
LLR statistic. 

With no systematic errors, the LLR performs around 
10% better than the asymmetry for ATLAS 8 TeV run, 



which corresponds to the upper dashed line. As the 
uncertainty increases, the LLR pdfs broaden and there 
is a loss in the discerning power of the test. Eventu- 
ally, the LLR statistic is expected to perform worse than 
the asymmetry measure, the solid line inside the shaded 
region. The lower dashed line depicts the significance 
achieved using the LLR and a larger systematic uncer- 
tainty of 30% in the normalization of the signal yield. 
Similar behavior was found for more data from combina- 
tions of runs and experiments. 

By its turn, the variation in the discrimination power 
of the asymmetries is less than 1% for the same 0-30% 
variation in the magnitude of the single systematic un- 
certainty. 

The lower panel of Figure (j6)) shows the significances 
by choosing a critical value for which the type-I and type- 
II errors are nearly equal in the hypothesis test. This is 
the no prejudice scenario discussed in section IV. Here, 4 
times more luminosity is necessary to reach the same CL. 
compared to the prejudicious scenario. Combining all 
the available data from the two experiments will enable 
a 5<7 discrimination with 25 fb _1 per experiment and 
around 4.5a for 20 fb _1 . On the other hand, a 99% CL 
discrimination is well within the LHC possibilities for the 
8 TeV run. 



VI. CONCLUSIONS 

The picture delineated by the current data on the new 
125 GeV boson discovered at the LHC already points to 
a Higgs boson. Yet, its identity, which must be uniquely 
assigned by the set of its quantum numbers, has not been 
determined - this identification is a crucial step to con- 
firm the new particle as a Higgs boson, be it SM or not. 
Discriminating its spin number will break the boson de- 
generacy into a scalar or a tensor state, once the vector 
alternative has already been discarded from the observed 
decays into photons pairs. 

The 77 channel is even more useful in the spin deter- 
mination. We showed that by measuring a center-edge 
asymmetry in the sPlots of the scattering polar angle 
of the diphotons, is sufficient to discriminate between a 
scalar J p — + and a tensor J p = 2 + resonance min- 
imally coupled to photons and gluons, using the KK- 
gravitons of the RS model as a straw man scenario. By 
the way, although there is a large number of possibili- 
ties of non minimal couplings, the KK-graviton inspired 
couplings are not the easiest conceivable scenario against 
whom the scalar hypothesis could be tested, from the 
point of view of the amount of data required to a high 
discrimination level. 

The asymmetries were also shown to be rather insensi- 
tive to systematic uncertainties, contrary to the usually 
more powerful log likelihood ratio statistic, and performs 
even better than the LLR for errors of moderate magni- 
tude. 

The sW eight technique proved to be essential in this 
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good performance, once it allows one to optimally sepa- 
rate signal and background events, which ultimately lead 
to a maximum discerning power based on the shapes of 
the distributions, being only limited by detector effects. 
The sPlots can be combined in a straightforward way and 
all the information accumulated in the 7 and 8 TeV runs 
from both ATLAS and CMS can be readily used in the 
analysis. 

As a result of the application of this technique, we 
found that by the time the 77 channel alone favors the 
signal hypothesis, with a SM strength, against the back- 
ground one at a 5ct confidence level, the scalar versus 
graviton hypotheses will be discriminated to a higher sig- 
nificance level of ~ 9er. A 5<r spin discrimination will 
possible, though, with less data, and an integrated lumi- 
nosity of approximately 10 ftT 1 for ATLAS or CMS is 
sufficient to discriminate between the alternatives. Also, 
if a large signal strength in the 77 channel, comparable to 
the current one, is really confirmed, the spin-0 hypothe- 
sis is already favored to ~ 5ct combining the ATLAS and 
CMS data. 

Keeping both the probability to accept the wrong hy- 
pothesis and the probability to reject the right one under 
control, in a less prejudicious approach, requires more 
data for a 5ct C.L. discrimination. We estimate that 
25 fb _1 for each experiment would be necessary in this 
case. Relying on 20 fb _1 per experiment a ~ 4er signifi- 
cance is still possible. 

VII. APPENDIX 

We present here the general amplitude for the decay 
of a spin- 2 particle into a pair of photons as given by 
Equation (J3J. The case for a decay into a massive pair 
of gauge bosons can be found in Refs. [ill (3 • 

The operators Oi , i = 1, • • • , 5 are written in terms of 
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